NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Control Theoretic Approach to Fine-Tuning and Transfer Learning

Bayram, Erkan; Liu, Shenyu; Belabbas, Mohamed-Ali; Basar, Tamer (October 2025, Lecture notes in control and information sciences proceedings)

Full Text Available
Analysis, State Estimation, and Control for the Networked Competitive Multi-Virus SIR Model

https://doi.org/10.1016/j.automatica.2025.112479

Zhang, Ciyuan; Gracy, Sebin; Basar, Tamer; Paré, Philip E (October 2025, Automatica)

Full Text Available
Quantization Enabled Privacy Protection in Decentralized Stochastic Optimization

https://doi.org/10.1109/TAC.2022.3198030

Wang, Yongqiang; Basar, Tamer (July 2023, IEEE Transactions on Automatic Control)

Full Text Available
A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control

Cui, Leilei; Basar, Tamer Basar; Jiang, Zhong-Ping (August 2023, Proceedings of Machine Learning Research)
N. Matni, M. Morari (Ed.)
In this paper, we propose a robust reinforcement learning method for a class of linear discrete-time systems to handle model mismatches that may be induced by sim-to-real gap. Under the formulation of risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to iteratively approximate the robust and optimal controller. The convergence and robustness of the dual-loop policy optimization algorithm are rigorously analyzed. It is shown that the dual-loop policy optimization algorithm uniformly converges to the optimal solution. In addition, by invoking the concept of small-disturbance input-to-state stability, it is guaranteed that the dual-loop policy optimization algorithm still converges to a neighborhood of the optimal solution when the algorithm is subject to a sufficiently small disturbance at each step. When the system matrices are unknown, a learning-based off-policy policy optimization algorithm is proposed for the same class of linear systems with additive Gaussian noise. The numerical simulation is implemented to demonstrate the efficacy of the proposed algorithm.
more » « less
Full Text Available
A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control

Cui, Leilei; Basar, Tamer; Jiang, Zhong-Ping (June 2023, Proceedings of Machine Learning Research)
Matni, N; Morari, M; Pappas, G J (Ed.)
In this paper, we propose a robust reinforcement learning method for a class of linear discrete-time systems to handle model mismatches that may be induced by sim-to-real gap. Under the formulation of risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to iteratively approximate the robust and optimal controller. The convergence and robustness of the dual-loop policy optimization algorithm are rigorously analyzed. It is shown that the dual-loop policy optimization algorithm uniformly converges to the optimal solution. In addition, by invoking the concept of small-disturbance input-to-state stability, it is guaranteed that the dual-loop policy optimization algorithm still converges to a neighborhood of the optimal solution when the algorithm is subject to a sufficiently small disturbance at each step. When the system matrices are unknown, a learning-based off-policy policy optimization algorithm is proposed for the same class of linear systems with additive Gaussian noise. The numerical simulation is implemented to demonstrate the efficacy of the proposed algorithm.
more » « less
Full Text Available
Gradient-tracking based Distributed Optimization with Guaranteed Optimality under Noisy Information Sharing

https://doi.org/10.1109/TAC.2022.3212006

Wang, Yongqiang; Basar, Tamer (October 2022, IEEE Transactions on Automatic Control)

Full Text Available
The Confluence of Networks, Games, and Learning a Game-Theoretic Framework for Multiagent Decision Making Over Networks

https://doi.org/10.1109/MCS.2022.3171478

Li, Tao; Peng, Guanze; Zhu, Quanyan; Basar, Tamer (August 2022, IEEE Control Systems)

Full Text Available
The H-property of Line Graphons

https://doi.org/10.23919/ASCC56756.2022.9828212

Belabbas, Mohamed-Ali; Chen, Xudong; Basar, Tamer (May 2022, 2022 13th Asian Control Conference)

Full Text Available
Fixed-Time Nash Equilibrium Seeking in Time-Varying Networks

https://doi.org/10.1109/TAC.2022.3168527

Poveda, Jorge I.; Krstic, Miroslav; Basar, Tamer (April 2022, IEEE Transactions on Automatic Control)

Full Text Available
Convergence and optimality of policy gradient primal-dual method for constrained Markov decision processes

https://doi.org/10.23919/ACC53348.2022.9867805

Ding, Dongsheng; Zhang, Kaiqing; Basar, Tamer; Jovanovic, Mihailo R. (June 2022, 2022 American Control Conference)

Full Text Available

« Prev Next »

Search for: All records